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ACOUSTIC SHOCK PREVENTION 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to preventing acoustic shock in voice 
5 communication systems. 

2. Background Art 

Exposure to noise can cause various health problems including 
permanent hearing loss, temporary hearing loss, tinnitus, headaches, depression, 
increased sensitivity to loud noises, and the like. The risk of injury and the extent 

10 of any resulting injury are functions of the intensity of noise levels, the length of 
exposure, the type of noise, and the sensitivity level of the person exposed. 
Acoustic shock can occur due to a sudden or unexpected noise or due to prolonged 
exposure to noise. Various types of workers are susceptible to injury from acoustic 
shock, including telephone operators, telemarketers, technical support personnel, 

15 and the like, who spend a great many hours using a telephone. Other activities that 
expose a person to prolonged sound sources may also be susceptible. 

A telephone near-end user can experience unpleasant sounds resulting 
in acoustic shock when an outgoing call inadvertently connects with a fax machine, 
modem, or similar device. Acoustic shock can also occur when a far-end user hits 
20 a telephone button, generating a DTMF signal. Other sources of acoustic shock can 
include noise coupled into a telecommunication system from natural or man-made 
sources, failure of telecommunications equipment, and the like. 

Different standards exist for limiting acoustic pressure in headsets and 
handsets. The North American A- weighted curves are based on a 90 dBA eight 
25 hour time weighted average free field noise exposure limit. In other words, a user 
cannot be subjected to more than 90 dBA noise level during an eight hour period. 
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The CERN standard permits an 85 dBA exposure limit over eight hours. Both 
standards are measured at the ear reference point (ERP). Both standards permit 
higher noise levels for shorter periods of time. In calculating exposure, acoustic 
pressure values may be corrected according to several factors including non- 
5 occupational exposure, whether the noise was received from a headset or a handset, 
signal bandwidth, and the like. 

What is needed is to prevent acoustic shock, as much as possible, 
without excessively inhibiting desirable signal components such as speech, dial 
tones, ring tones, and the like. Acoustic shock should be prevented without the use 
10 of a computationally intensive algorithm, permitting inexpensive implementation 
within handsets, headsets, and other telecommunications equipment. 

SUMMARY OF THE INVENTION 

The present invention provides a shriek rejecter which attenuates an 
input signal based on tones detected through frequency domain analysis. 

15 A system for preventing acoustic shock is provided. The system 

includes an attenuator for attenuating an input signal by a variable amount. A 
frequency analyzer converts a time windowed portion of the input signal into 
frequency bins. An energy calculator determines a relative energy signal having an 
element corresponding to each frequency bin. Each element of the relative energy 

20 signal is based on energy magnitude in the corresponding frequency bin relative to 
a total energy of the time windowed portion of the input signal. Each of a plurality 
of signal detectors generates a detection signal for modifying the variable attenuation 
amount based on the relative energy signal. At least one of the detection signals 
includes a vector of attenuation elements, each attenuation element corresponding 

25 to one of the frequency bins. Attenuation logic determines an attenuation value 
signal controlling the variable attenuation amount based on the detection signals. 

In an embodiment of the present invention, at least one of the signal 
detection signals, when asserted, inhibits attenuation by the variable attenuator. 
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In another embodiment of the present invention, a difference 
calculator calculates a difference signal based on a difference in energy between 
each pair of adjacent frequency bins. A general tone detector generates a general 
tone detection signal based on the relative energy signal and on the difference signal. 
5 The general tone detection signal may include a vector of attenuation elements with 
each attenuation element asserted if that particular element has a corresponding 
difference signal element exceeding a difference threshold and a corresponding 
relative energy signal element less than a relative energy threshold. 

In still another embodiment of the present invention, a fax/modem 
10 detector generates a fax/modem detection signal based on any element in a subset 
of the relative energy signal exceeding a preset threshold. 

In yet another embodiment of the present invention, a select tone 
detector generates a select tone detection signal based on at least one element of the 
relative energy signal exceeding a preset threshold. The select tone detector 
15 examines relative energy signal elements corresponding to known select tone 
frequencies such as dial tones and ring tones. 

In a further embodiment of the present invention, the attenuation logic 
scales each attenuation element of at least one detection signal having a vector of 
attenuation elements. 

20 In a still further embodiment of the present invention, the attenuation 

logic implements a spreading filter across the attenuation elements of at least one of 
the detection symbols having a vector of attenuation elements. 

A method of reducing acoustic shock is also provided. The spectrum 
of an input signal is obtained. A relative energy signal is determined as a plurality 
25 of relative energy elements. Each relative energy element represents the relative 
energy in a corresponding frequency bin of the input signal spectrum. A difference 
signal is determined as a plurality of different elements. Each difference element 
represents a difference between a corresponding frequency bin value and an adjacent 
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frequency bin value. A plurality of detection signals are determined. Each 
detection signal detects the presence of a sound element in the input signal based on 
at least one of the relative energy signal and the difference signal. The detection 
signals are combined to produce an attenuation signal having an attenuation element 
5 corresponding to each frequency bin of the input signal. The input signal is 
attenuated using the attenuation signal. 

A method of reducing acoustic shock is provided. The spectrum of 
an input signal is obtained. A relative energy signal is determined as a function of 
frequency based on the spectrum. A difference signal is determined based on a 

10 change in magnitude in the spectrum as a function of frequency. A general tone 
signal is determined as a function of the relative energy signal and the difference 
signal. A fax/modem signal is determined as a function of the relative energy 
signal. A select tone signal is determined as a function of the relative energy signal. 
An attenuation signal is determined as a function of frequency based on the general 

15 tone signal, the fax/modem signal and the select tone signal. A sound signal is 
attenuated based on the attenuation signal. 

A method of preventing acoustic shock is provided. A sound signal 
is attenuated if at least one of a general tone and a fax/modem tone is detected, but 
only if at least one select tone is not detected. 

20 The above features, and other features and advantages of the present 

invention are readily apparent from the following detailed descriptions thereof when 
taken in connection with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a block diagram illustrating an acoustic shock 
25 prevention system according to an embodiment of the present invention; 

FIGURE 2 is a block diagram illustrating a shriek rejecter according 
to an embodiment of the present invention; 
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FIGURE 3 is a flow diagram illustrating acoustic chock prevention 
according to an embodiment of the present invention; and 

FIGURES 4a and 4b are input and output signals, respectively, 
illustrating acoustic shock prevention according to an embodiment of the present 
5 invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 

Referring to Figure 1 , a block diagram illustrating an acoustic shock 
prevention system according to an embodiment of the present invention is shown. 
An acoustic shock prevention system, shown generally by 20, receives input signal 

10 22 from telecommunication system 24. Telecommunication system 24 may be 
subject to a wide variety of noise sources 26 such as fax machines, modems, dual 
tone multi-frequency (DTMF) signals, special information tone (SIT) signals, cross- 
coupling from man-made and natural sources, and the like. Noise from one or more 
of sources 26 appearing on input signal 22 can create injury or discomfort to a user 

15 listening to signal 22. 

Acoustic shock prevention system 20 reduces the effects of noise 
sources 26 in input signal 22 without substantially affecting desirable components 
of signal 22 including speech, dial tones, ring tones, and the like. Acoustic shock 
prevention system 20 includes shriek rejecter 28 which generates output signal 30 
20 having substantially reduced components generated by noise sources 26. Shriek 
rejecter 28 detects high energy concentrations in small bandwidths of unwanted 
components in signal 22 and sets a variable attenuator to remove these components. 
The operation of shriek rejecter 28 is described in detail below. 

Acoustic shock prevention system 20 may also include additional 
25 components improving the quality of output signal 30. Noise canceller 32 follows 
shriek rejecter 28 to improve the signal-to-noise ratio of output signal 30. Noise 
canceller 32 may be implemented using either frequency domain or time domain 
noise cancellation routines. One suitable routine is disclosed in U.S. Patent 
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application Serial No. 10/272,921, titled "Noise Reduction In Subbanded Speech 
Signals," filed October 17, 2002, which is incorporated by reference herein in its 
entirety. 

Acoustic shock prevention system 20 may also include transient noise 
5 remover 34 for reducing transients and short duration disturbances in signal 30. 
Very short duration transients may arise from noise sources 26 or may appear as 
artifacts in signal 30 after removing unwanted components from input signal 22. 
Algorithms for removing brief transients are well known in the art of signal 
processing. 

10 Acoustic shock protection system 20 may also include compressor 36 

limiting the amplitude of output signal 30. Compressor 36 may provide soft or hard 
limiting of signal magnitude to prevent excessively loud portions of desired 
components in input signal 22 from creating acoustic shock. The output of acoustic 
shock prevention system 20 may be supplied to handset 38 or headset 40. 

15 Referring now to Figure 2, a block diagram illustrating a shriek 

rejecter according to an embodiment of the present invention is shown. As will be 
appreciated by one of ordinary skill in the art, the operations illustrated in the block 
diagrams are not necessarily sequential operations. Similarly, operations may be 
controlled by software, hardware, or a combination of both. Various elements of 

20 Figure 2 may be implemented within a microprocessor, digital signal processor, 
custom integrated circuit, discrete electronic components, and the like. The present 
invention transcends any particular implementation and the embodiment is shown 
in block diagram form for ease of illustration. 

Shriek rejecter 28 includes a frequency analyzer, shown generally by 
25 50, which determines a spectrum for each of a sequence of time windowed portions 
of input signal 22. In the example shown, input signal 22 is assumed to be an 
analog signal. However, as will be recognized by one of ordinary skill in the art, 
the present invention applies whether input signal 22 is received in analog form, 
discrete time form, digitized form, or as spectral components. 
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Anti-aliasing filter 52 removes high frequency components from input 
signal 22. Anti-aliasing filter 52 may be, for example, a ninth order low-pass filter 
with a cut-off frequency of 4 kHz. Analog-to-digital converter (ADC) 54 digitizes 
the output of anti-aliasing filter 52. In this example, a sampling rate of 8 kHz is 
5 appropriate. Add-overlap buffer 56 buffers the digitized input signal 22 for time- 
windowing and provides overlap for Hanning window 58. Various overlaps are 
possible with values of 25% and 50% typical. Hanning window 58 scales the 
amplitude of windowed input signal 22. Fast Fourier transform (FFT) 60 calculates 
a spectrum for each time windowed input signal 22. An advantage of the present 
10 invention is the ability to reduce unwanted components in input signal 22 using a 
relatively low resolution frequency transform. For example, thirty-two bins 
spanning 0 to 4 kHz are sufficient for most speech applications. 

Variable attenuator 64 attenuates spectrum signal 62 to produce 
attenuated spectrum signal 66 based on attenuation value signal 68. Preferably, 

15 attenuation value signal 68 is a vector having an attenuation element corresponding 
to each frequency bin in spectrum signal 62. Variable attenuator 64 scales each 
element of spectrum signal 62 with a corresponding value from attenuation value 
signal 68. Alternatively, attenuation value signal 68 may include a vector having 
a number of elements different than the number of frequency bins in spectrum signal 

20 62. In the simplest case, attenuation value signal 68 is a scalar value for modifying 
each bin value in spectrum signal 62 or for modifying a time-domain signal. 

Shriek rejecter 28 may include frequency-to-time converter 70 for 
converting attenuated spectrum signal 66 into output signal 30. In the example 
shown, inverse fast Fourier transform (IFFT) 72 converts attenuated spectrum signal 

25 66 into a digital time domain representation. Hanning window 74 and add-overlap 
buffer 76 reverse the effects of Hanning window 58 and add-overlap buffer 56, 
respectively. Digital-to-analog converter (DAC) 78 produces a continuous time 
signal which is filtered by anti-imaging filter 80 to remove high frequency artifacts 
produced in the analog conversion. The resulting continuous time analog signal 30 

30 represents input signal 22 with unwanted components removed. As will be 
recognized by one of ordinary skill in the art, some or all of frequency-to-time 
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converter 70 may be omitted depending upon downstream processing of output 
signal 30. For example, if noise canceller 32 operates in the frequency domain, 
shriek rejecter 28 need not contain converter 70. 

Shriek rejecter 28 calculates attenuation value signal 68 based on the 
5 magnitude components of spectrum signal 62, as indicated by bin magnitude 82. An 
energy calculator, shown generally by 84, generates relative energy signal 86 
indicating the relative energy in each frequency bin. The total energy is calculated 
by summing all bin values, as indicated by total energy 88. The relative energy per 
bin is calculated by dividing each bin value by the total energy, as indicated by 
10 relative energy/bin 90. Relative energy signal 86 is useful in shriek reduction 
because speech tends to have wide energy dispersion over frequency whereas tonal 
noise does not. 

Shriek rejecter 28 may also include bin differencer 92 for determining 
the difference between each pair of adjacent bins in the magnitude spectrum. For 
15 example, each difference signal bin D(ri) can be determined by the following 
equation: 

D(n) = F(n) - F(n + 1) 

where F(ri) is the frequency bin, N is the number of frequency bins, and n ranges 
from zero to N- 1 . The last value, D(N- 1), may be set to D(N-2), F(N- 1), zero, 
or any other suitable value. Each bin may be time averaged in averaging filter 94 
20 to produce difference signal 96. Averaging filter 94 may be implemented, for 
example, by a first-order digital filter. 

Shriek rejecter 28 includes a plurality of signal detectors, shown 
generally by 98, each of which generates a detection signal, shown generally by 
100. General tone detector 102 generates general tone detection signal 104 based 
25 on relative energy signal 86 and difference signal 96. General tone detection signal 
104 may be a vector having an element corresponding with each frequency bin. The 
element is asserted if a tone is detected within the corresponding bin. To avoid 
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asserting elements in general tone detection signal 104 during periods of speech, 
elements of general tone detection signal 104 are not asserted if spectral energy is 
widely distributed as is common in speech. Algorithmically, each element of 
general tone detection signal 104 is asserted only if the corresponding difference 
5 signal element exceeds a difference threshold and a corresponding relative energy 
signal element exceeds a relative energy threshold. 

Fax/modem detector 106 generates fax/modem detection signal 108 
based on a subset of relative energy signal 86. Fax/modem detector 106 examines 
a limited spectral range known to contain frequencies used by fax machines, 
10 modems, and other digital transmission equipment. If the energy in any frequency 
bin contained within this spectral range exceeds a preset threshold, the 
corresponding element of fax/modem detection signal is asserted. The spectral 
range over which examination occurs may be preset or may be set for different 
applications and/or regions. 

15 Select tone detector 110 generates select tone detection signal 112 

indicating the presence of one or more desirable tonal components in input signal 
22. Desirable components include one or more dial tones, ring tones, and the like. 
Select tones include any tonal signal that must be passed through in output signal 30. 
Select tone detector 110 examines specific bins in relative energy signal 86 for 

20 values above a preset threshold. For any select bin in which the threshold is 
exceeded, the corresponding component in select tone detection signal vector 1 12 
is asserted. Alternatively, select tone detection signal 112 may be a scalar value 
asserted if any select bin energy value exceeds a preset threshold. 

Shriek rejecter 28 includes attenuation logic, shown generally by 114, 
25 generating attenuation value signal 68 based on detection signals 100. Preferably, 
attenuation value signal 68 has an element controlling the attenuation of each 
frequency bin in spectrum signal 62, with an element value of " 1 " causing complete 
attenuation and an element value of "0" causing no attenuation. Attenuation logic 
114 may include attenuation calculators for setting attenuation element values 
30 between zero and one. 
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Attenuation logic 114 may include spreading filters for smoothing 
attenuation signals in frequency. For example, an attenuation vector may indicate 
the presence of a tone with a "1" as follows: [..., 0, 0, 0, 1, 0, 0, 0, ...] 
Smoothing this vector spreads attenuation values into adjacent bins resulting in, for 
5 example, the following vector: [..., 0, 0.2, 0.5, 0.8, 0.5, 0.2, 0, ...]. Spectral 
attenuation spreading reduces abrupt transitions as a function of frequency, reducing 
unnatural effects in output signal 30. Various smoothing filters are known, such as 
Hanning windows, Hamming windows, and the like. 

In the exemplary system shown in Figure 2, general tone attenuation 
10 calculation 116 receives general tone detection signal 104. The output of general 
tone attenuation calculation 116 is passed through general tone spreader 118 to 
produce general tone coefficients 120. Fax/modem attenuation calculation 122 
scales fax/modem detection signal 108. Averaging filter 124 averages scaled 
coefficients over time. Fax/modem spreader 126 frequency averages the time 
15 averaged coefficients to produce fax/modem coefficients 128. 

Attenuation combiner 130 combines coefficients from various signal 
detectors 98 to generate attenuation value signal 68. In the example shown, 
fax/modem coefficients 128 and general tone coefficients 120 are multiplied, 
element-by -element. This product vector is passed as attenuation value signal 68 if 
20 select tone detection signal 112 does not indicate the presence of desired tones in 
input signal 22. Select tone detection signal 112 may be used to deactivate 
attenuation on a per-bin basis if select tone detection signal 112 is a vector. 
Alternatively, select tone detection signal 112 may inhibit all attenuation if a scalar 
value is produced. 

25 Referring now to Figure 3, a flow diagram illustrating acoustic shock 

prevention according to an embodiment of the present invention is shown. As will 
be appreciated by one of ordinary skill in the art, the operations illustrated in the 
flow diagrams are not necessarily sequential operations. Similarly, operations may 
be controlled by software, hardware, or a combination of both. The present 
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invention transcends any particular implementation and the embodiment is shown 
in a sequential flow chart form for ease of illustration. 

A time-windowed value of an input signal is obtained, as in block 
140. A spectrum of the time-windowed value is calculated, as in block 142. A 
5 check is made to determine if any general tones are present, as in block 144. A 
check is also made to determine if any fax/modem tone is present, as in block 146. 
In a preferred embodiment, each check is made regardless of the outcome of the 
other check. If neither general tones nor fax/modem tones are present, no 
attenuation is provided. The process is repeated by obtaining the next time window. 

10 If any general tone or a fax/modem tone is detected, a check is made 

to determine if a special tone is present, as in block 148. If no special tone is 
present, the input signal is heavily attenuated to remove general tones and/or 
fax/modem tones, as in block 150. If a special tone is detected, either light or no 
attenuation is provided, as in block 152. Thus, the presence of special tones 

15 overrides attenuation that would otherwise be based on the detection of general tones 
and/or fax/modem tones. 

Referring now to Figures 4a and 4b, input and output signals, 
respectively, illustrating acoustic shock prevention of an embodiment of the present 
invention are shown. Sample input signal 22 includes region 160 containing a dial 

20 tone, region 162 containing DTMF signals, region 164 containing SIT signals, 
region 166 including a ring tone, region 168 including a fax signal, and region 170 
including a speech signal. Corresponding exemplary output signal 30 substantially 
passes the dial tone signal, indicated by region 160', the ring tone signal, indicated 
by region 166\ and the speech signal, indicated by region 170'. Output signal 30 

25 shows as substantially eliminated DTMF signals, in region 162', SIT signals, in 
region 164', and fax signals, in region 168'. 

While embodiments of the invention have been illustrated and 
described, it is not intended that these embodiments illustrate and describe all 
possible forms of the invention. Rather, the words used in the specification are 
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words of description rather than limitation, and it is understood that various changes 
may be made without departing from the spirit and scope of the invention. 
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